Activity Duration Prediction of Workflows by using a Data Science Approach: Unveiling the Advantage of Semantics
نویسندگان
چکیده
Organizations often have to face a dynamic market environment. Processes must be frequently adapted in order to stay competitive and allow an efficient workflow. Data Science approaches are currently often used in analysis methods to identify influential indicators on processes and learn predictive models to estimate the duration of an activity. However, current methods do not or only partially make use of semantic information in process analysis. The results are unprecise or incomplete, because not all influential indicators have been unveiled and therefore used in the predictive models. We want to make use of the semantics and show the advantage by applying them on existing data science methods for predicting the duration of an activity in a process. Therefore, we 1) enrich process data with metainformation and background knowledge 2) extend existing data science methods so that they include semantic information in their analysis and 3) apply data science methods for predicting values and compare the results with methods, which do not use semantics.
منابع مشابه
Dynamic Categorization of Semantics of Fashion Language: A Memetic Approach
Categories are not invariant. This paper attempts to explore the dynamic nature of semantic category, in particular, that of fashion language, based on the cognitive theory of Dawkins’ memetics, a new theory of cultural evolution. Semantic attributes of linguistic memes decrease or proliferate in replication and spreading, which involves a dynamic development of semantic category. More specific...
متن کاملDynamic configuration and collaborative scheduling in supply chains based on scalable multi-agent architecture
Due to diversified and frequently changing demands from customers, technological advances and global competition, manufacturers rely on collaboration with their business partners to share costs, risks and expertise. How to take advantage of advancement of technologies to effectively support operations and create competitive advantage is critical for manufacturers to survive. To respond to these...
متن کاملSurface Pressure Contour Prediction Using a GRNN Algorithm
A new approach based on a Generalized Regression Neural Network (GRNN) has been proposed to predict the planform surface pressure field on a wing-tail combination in low subsonic flow. Extensive wind tunnel results were used for training the network and verification of the values predicted by this approach. GRNN has been trained by the aforementioned experimental data and subsequently was used ...
متن کاملTown trip forecasting based on data mining techniques
In this paper, a data mining approach is proposed for duration prediction of the town trips (travel time) in New York City. In this regard, at first, two novel approaches, including a mathematical and a statistical approach, are proposed for grouping categorical variables with a huge number of levels. The proposed approaches work based on the cost matrix generated by repetitive post-hoc tests f...
متن کاملArgos: a framework for automatically generating data processing workflows
Demo. We demonstrate Argos, a framework to automatically generate data processing workflows. First, we show how to assign formal semantics to data and operations using to a domain ontology. Specifically, we define data contents using relational descriptions in an expressive logic. Second, we show a novel planner that uses relational subsumption to connect the output of a data processing operati...
متن کامل